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Abstract 

Using specializations of unfold and fold on a generic tree 
data type we derive unranking and ranking functions provid- 
ing natural number encodings for various Hereditarily Finite 
datatypes. 

In this context, we interpret unranking operations as in- 
stances of a generic anamorphism and ranking operations as 
instances of the corresponding catamorphism. 

Starting with Ackerman's Encoding from Hereditarily 
Finite Sets to Natural Numbers we define pairings wd finite 
tuple encodings that provide building blocks for a theory of 
Hereditarily Finite Functions. 

The more difficult problem of ranking and unrank- 
ing Hereditarily Finite Permutations is then tackled using 
Lehmer codes and factoradics. 

The self-contained source code of the paper, as generated 



from a literate Haskell program, is available at http : // 
[logic ■ csci .unt . ed.u/teLrau/research/2008/f FUN.zip 



Keywords ranking/unranking, pairing/tupling functions, 
Ackermann encoding, hereditarily finite sets, hereditarily fi- 
nite functions, permutations and factoradics, computational 
mathematics, Haskell data representations 

1. Introduction 

This paper is an exploration with functional programming 
tools of ranking and unranking problems on finite functions 
and bijections and their related hereditarily finite universes. 
The ranking problem for a family of combinatorial objects 
is finding a unique natural number associated to it, called its 
rank. The inverse unranking problem consists of generating 
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a unique combinatorial object associated to each natural 
number. 

The paper is organized as follows: section |2] introduces 
generic ranking/unranking framework parameterized by bi- 
jective transformers and terminating conditions based on 
urelements, section [3]introduces Ackermann's encoding and 
its inverse as instances of the framework. After discussing 
some classic pairing functions, section |4] introduces new 
pairing/unpairing and tuple operations on natural numbers 
and uses them for encodings of finite functions (section |5]l, 
resulting in encodings for "Hereditarily Finite Functions" 
(section|6]l. Ranking/unranking of permutations and Heredi- 
tarily Finite Permutations as well as Lehmer codes and fac- 
toradics are covered in section |7] Sections [8] and |9] discuss 
related work, future work and conclusions. 

The paper is part of a larger effort to cover in a declara- 
tive programming paradigm, arguably more elegantly, some 
fundamental combinatorial generation algorithms along the 
lines of ( Knuth|2006 l. The practical expressiveness of func- 
tional programming languages (in particular Haskell) are put 
at test in the process. 

While the main focus of the paper was testdriving Haskell 
on the curvy tracks of non-trivial combinatorial generation 
problems, we have bumped, somewhat accidentally, into 
making a few new contributions to the field as such, that 
could be easily blamed on the quality of the vehicle we were 
testdriving: 



1. the three ranking/unranking algorithms from finite func- 
tions to natural numbers are new 

2. the universe of Hereditarily Finite Functions, as a func- 
tional analogue of the well known universe of Hereditar- 
ily Finite Sets is new 

3. the universe of Hereditarily Finite Permutations, as an 
analogue of the well known universe of Hereditarily Fi- 
nite Sets is new 
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4. the natural number tupling/untupling functions are new 



5. the ranking/unranking algorithm for permutations of ar- 
bitrary sizes is new (ahhough it is based on a known 
Lehmer code-based algorithm for permutations of fixed 
size) 

6. the catamorphism/anamorphism view of ranking/unrank- 
ing functions is new and it is likely to be reusable for 
various families of combinatorial generation problems 

Through the paper, we will use the following set of prim- 
itive arithmetic functions: 

double n = 2*n 
half n = n 'div' 2 
exp2 n = 2~n 

together with succ, pred, even, odd and sum Haskell func- 
tions, to emphasize that this subset is easily hardware im- 
plementable (by only using boolean operations, shifts and 
adders) and that these functions also have D(log n) or 
better software implementations for integers of (arbitrary) 
length n. 

When possible, we will use point-free notations (unnec- 
essary function arguments omitted) to emphasize the generic 
function composition dataflow. 

As we have put significant effort to ensure that all our 
types can be inferred, we will omit type declarations, with 
apologies to the type-curious reader, who can have them dis- 
played as needed, while interacting with the Haskell sources 
of the paper available online. 

2. Generic Unranking and Ranking with 
Higher Order Functions 

We will use, through the paper, a generic "rose tree" type T 
distinguishing between atoms tagged with A) and subforests 
(tagged with F). 

data T a = A a | F [T a] deriving (Eq, Ord, Read, Show) 

Atoms will be mapped to natural numbers in [0 . . ulimit-1] . 
When ulimit is fixed, we denote this set A. We denote Nat 
the set of natural numbers and T the set of trees of type T 
with atoms in A. 

The unranking operation is seen here as an instance of a 
generic anamorphism mechanism unfold, while the ranking 
operation is seen as an instance of the corresponding cata- 
morphism/oZii ( |Hutton| 19991 [Meijer ™d Hutton|1995[ ). 

Unranking As an adaptation of the unfold operation, natu- 
ral numbers will be mapped to elements of T with a generic 
higher order function unraxik. ulimit f, defined from 
Nat to T, parameterized by the the natural number ulimit 
and the transformer function f : 

unraiik_ ulimit _ n | (n<ulimit) && (n>0) = A n 
unrank_ ulimit f n | n > ulimit — 

(F (map (unrank_ ulimit f) (f (n-ulimit) ) ) ) 

A global constant def ault_ulimit will be used through 
the paper to fix the default range of atoms, allowing us to 
work with a default unrank function: 



def ault_ulimit — 

unrank = unrank_ def ault_ulimit 

Ranking Similarly, as an adaptation of fold, generic in- 
verse mappings rank_ ulimit and rank) from T to Nat 
are defined as: 

rank_ ulimit _ (A n) | (n<ulimit) && (n>0) = n 
rank_ ulimit g (F ts) = 

ulimit+(g (map (rank_ ulimit g) ts)) 

rank — rank_ def ault_ulimit 

Note that the guard in the second definition simply states 
correctness constraints ensuring that atoms belong to the 
same set A for rank, and unrank_. This ensures that the 
following holds: 

Proposition 1. If the transformer function f : Nat 
[Nat] is a bijection with inverse g, such that n > ulimit A 
/(n) — [no, ■■■rii, ...Hk] =^ rii < n, then unrsink is a 
bijection from Nat to T, with inverse rank and the recursive 
computations of both functions terminate in a finite number 
of steps. 

Proof: by induction on the structure of Nat and T, using 
the fact that map preserves bijections. 

Ranking functions can be traced back to Godel number- 
ings ( Godel|1931 Hartmanis and Baker|1974i ) associated to 
formulae. Together with their inverse unranking functions 
they are also used in combinatorial generation algorithms 
( [Martinez and Moline ro 2003; Knuth 2006). 

3. Hereditarily Finite Sets and Ackermann's 
Encoding 

While the Universe of Hereditarily Finite Sets is best known 
as a model of the Zermelo-Fraenkel Set theory with the Ax- 
iom of Infinity replaced by its negation (Takahashi '1 976[ 
|Meir et aL]|1983| ), it has been the object of renewed prac- 
tical interest in various fields, from representing structured 
data in databases (Leontjev and Sazonov 2000) to reasoning 
with sets and set constraints ( jPovier et al.|2000( |Piazza and| 
|Policriti|20()4l ). 

3.1 Ackermann's Encoding 

The Universe of Hereditarily Finite Sets is built from the 
empty set (or a set of Urelements) by successively applying 
powerset and set union operations. 

A surprising bijection, discovered by Wilhelm Acker- 
mann in 1937 (Ackermann 1937] [Abian and Lamacchia] 
|1978[ |Kaye and Wong,2007) maps Hereditarily Finite Sets 
(HFS) to Nafiiral Numbers (Nat): 

fix) = ifx = {} then else J2aex 2-^^°^ 



Assuming HFS extended with Urelements (objects not 
containing any elements) our generic "rose tree" represen- 
tation can be used for Hereditarily Finite Sets, with Urele- 
ments seen as atoms, i.e. Natural Numbers in [0 . . ulimit-1] 
The constructor A a marks Urelements of type a (usually the 
arbitrary length Integer type in Haskell) and the constructor 
F marks a list of recursively built HFS type elements. Note 
that if no elements are used with the A constructor, we obtain 
the "pure" HFS universe with everything built out from the 
empty set represented as F [] . 

Let us note that Ackermann's encoding can be seen as the 
recursive application of a bijection set2nat from finite sub- 
sets of Nat to Nat, that associates to a set of (distinct!) nat- 
ural numbers a (unique!) natural number With this represen- 
tation, Ackermann's encoding from HFS to Nat hf s2nat 
can be expressed in terms of our generic rank function as: 

hfs2nat — rank set2nat 

set2nat ns — sum (map exp2 ns) 

To obtain the inverse of the Ackerman encoding, let's first 
define the inverse nat2set of the bijection set2nat. It de- 
composes a natural number into a list of exponents of 2 (seen 
as bit positions equaling 1 in its bitstring representation, in 
increasing order). 

nat2set n = nat2exps n where 
nat2exps _ = [] 
nat2exps n x = 

if (even n) then xs else (x:xs) where 
xs=nat2exps (half n) (succ x) 

The inverse of the Ackermann encoding, with urelements in 
[0. .def ault_ulimit-l] and the empty set mapped to F 
[] is defined as follows: 

nat2hf s — unrank nat2set 

This definition is motivated by the fact that nat2hf s and 
hfs2nat are obtained through recursive compositions of 
nat2set and set2iiat, respectively. Generalizing the en- 
coding mechanism to use other bijections with similar prop- 
erties, naturally leads to the anamorphism/catamorphism 
view of unrank/rank. 

The following proposition summarizes the results in this 
subsection: 

Proposition 2. Given id = Xx.x, the following function 
equivalences hold: 

nat2set o set2nat = id (1) 

set2nat o nat2set = id (2) 

nat2hfs o hfs2nat = id (3) 

hfs2nat o nat2hfs = id (4) 



3.2 Combinatorial Generation as Iteration 

Using the inverse of Ackermann's encoding, the infinite 
stream HFS can be generated simply by iterating over the 
infinite stream [0 . . ] : 

iterative_hf s_generator=map nat2hfs [0..] 

take 5 iterative_hf s_generator 
[F [],F [F []],F [F [F []]], 

F [F [],F [F []]],F [F [F [F []]]]] 

One can try out nat2hf s and its inverse hf s2nat and 
print out a canonical string representation of HFS with the 
setShow functions given in Appendix: 

nat2hfs 42 

F [F [F []],F [F [],F [F []]], 
F [F [],F [F [F []]]]] 
hfs2nat (nat2hfs 42) 

42 

setShow 42 

" {{{}} , {{} , {{»} , {{} , {{{}}»} " 

Note that setShow n will build a string representation of 
n G Nat, implicitly "deforested" as a HFS with Urele- 
ments in [0. . def ault_ulimit-l] . Figure [T] shows the di- 
rected acyclic graph obtained by merging shared nodes in the 
rose tree representation of the HFS associated to a natural 
number (with arrows pointing from sets to their elements). 




Figure 1 : Hereditarily Finite Set associated to 42 

4. Pairing Functions and Tuple Encodings 

Pairings are bijective functions Nat x Nat Nat. Fol- 
lowing the classic notation for pairings of ( |Robinson|1950| l, 
given the pairing function J, its left and right inverses K and 
L are such that 



JiK{z),L{z))^z 



K{J{x,y))=x 



L{J{x,y)) = y 



(5) 



(6) 



(7) 



We refer to ("Cegielski and Richard 200 l| l for a typical use 
in the foundations of mathematics and to (| Rosenberg|2002| l 
for an extensive study of various pairing functions and their 
computational properties. We will start by overviewing two 
classic pairing functions. 

4.1 Cantor's Pairing Function 

Cantor's geometrically inspired pairing function (also present 
in earlier work by Cauchy) is defined as: 

nat_cpair x y = (x+y) * (x+y+1) 'div' 2+y 

As the following example shows, it grows symmetrically in 
both arguments: 

[nat_cpair i j I i<-[0. .3] , j<-[0. .3]] 

[0,2,5,9,1,4,8,13,3,7,12,18,6,11,17,24] 

4.2 The Pepis-Kalmar- Robinson Pairing Function 

An interesting pairing function asymmetrically growing, 
faster on the first argument, is the function pepis J and its 
left and right unpairing companions pepis_K and pepis_L 
that have been used, by Pepis, Kalmar and Robinson to- 
gether with Cantor's functions, in some fundamental work 
on recursion theory, decidability and Hilbert's Tenth Prob- 
lemJnJPpIs] fT938l |Kalmar] fT939l |Kalmar, Laszlo andj 
Suranyi, Ja nos 1947, 1950; Robinson 1950 1955 1968a b'^ 
19671). The function pepisj combines two numbers re- 
versibly by multiplying a power of 2 derived from the first 
and an odd number derived from the second: 

/(x,y) = 2-*(2*j/ + l)-l (8) 

Its Haskell implementation, together with its inverse is: 
pepis_J X y = pred ((exp2 x)*(succ (double y))) 

pepis_K n = two_s (succ n) 

pepis_L n = half (pred (no_two_s (succ n))) 

two_s n I even n = succ (two_s (half n) ) 
two_s _ = 

no_two_s n = n 'div' (exp2 (two_s n) ) 

This pairing function (slower in the second argument) works 
as follows: 

pepis_J 1 10 
41 

pepis_J 10 1 
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[pepis_J i j |i<-[0. .3] ,j<-[0. .3]] 

[0,2,4,6,1,5,9,13,3,11,19,27,7,23,39,55] 

As Haskell provides a built-in ordered pair, it is convenient 
to regroup J , K , L as mappings to/from built-in ordered 
pairs: 

haskell2pepis (x,y) = pepis_J x y 
pepis2haskell n — (pepis_K n,pepis_L n) 

4.3 The BitMerge Pairing Function 

We will introduce here an unusually simple pairing function 
(that we have found out recently as being the same as the 
one in defined in Steven Pigeon's PhD thesis on Data Com- 
pression ( Pigeon|2001| , page 1 14). It provides compact rep- 
resentations for various constructs involving ordered pairs. 

The bijection bitmerge_pair from Nat x Nat to Nat 
and its inverse bitmerge.unpair are defined as follows: 

bitmerge_pair (i,j) = 

set2nat ((evens i) 4-f (odds j)) where 
evens x = map double (nat2set x) 
odds y — map succ (evens y) 

bitmerge_unpair n = (f xs,f ys) where 
(xs.ys) = partition even (nat2set n) 
f = set2nat . (map half) 

The function bitmerge_pair works by splitting a number's 
big endian bitstring representation into odd and even bits 
while its inverse bitmerge_unpair blends the odd and even 
bits back together. With help of the function to_rbits given 
in Appendix, that decomposes n E Nat into a list of bits 
(smaller units first) one can follow what happens, step by 
step: 

to_rbits 2008 

[0,0,0,1, 1,0,1,1, 1,1,1] 
bitmerge_unpair 2008 

(60,26) 
to_rbits 60 

[0,0, 1,1, 1,1] 
to_rbits 26 

[0,1, 0,1, 1] 
bitmerge_pair (60,26) 

2008 

Proposition 3. The following function equivalences hold: 



bitmergc-pair o bitmergejunpair = id 



(9) 



bitmergejunpair o bitmerge_pair = id (10) 

4.4 Tuple Encodings as GeneraUzed BitMerge 

We will now generalize this pairing function to /c-tuples and 
then we will derive an encoding for finite functions. 



The function to_tuple : Nat Nat^ converts a natu- 
ral number to a A; -tuple by splitting its bit representation into 
k groups, from which the k members in the tuple are finally 
rebuilt. This operation can be seen as a transposition of a bit 
matrix obtained by expanding the number in base 2^: 

to_tuple k n = map from_rbits ( 
transpose ( 

map (to_maxbits k) ( 
to_base (exp2 k) n 

) 

) 

) 

To convert a fc-tuple back to a natural number we will merge 
their bits, fc at a time. This operation uses the transposition 
of a bit matrix obtained from the tuple, seen as a number 
in base 2*', with help from bit crunching functions given in 
Appendix: 

from_tuple ns = from_base (exp2 k) ( 
map from_rbits ( 
transpose ( 

map (to_maxbits 1) ns 

) 

) 

) where 

fc^genericLength ns 
l=max_bitcount ns 

The following example shows the decoding of 42, its decom- 
position in bits (right to left), the formation of a 3-tuple and 
the encoding of the tuple back to 42. 

to_rbits 42 

[0,1,0, 1,0,1] 
to_tuple 3 42 

[2,1,2] 
to_rbits 2 

[0,1] 
to_rbits 1 

[1] 

from_tuple [2,1,2] 
42 

Fig. |2] shows multiple steps of the same decomposition, 
with shared nodes collected in a DAG. Note that cylinders 
represent markers on edges indicating argument positions, 
the cubes indicate leaf vertices (0,1) and the small pyramid 
indicates the root where the expansion has started. 

The following proposition states that this tupling function 
is a generalization of bitmerge_pair 

Proposition 4. The following function equivalences hold: 

bitmergejunpair n = toJ-uple 2 n (11) 
bitmerge-pair {x,y) = fromjtuple [x,y] (12) 




Figure 2: Repeated 3-tuple expansions: 42 and 2008 

5. Encoding Finite Functions 

As finite sets can be put in a bijection with an initial segment 
of Nat we can narrow down the concept of finite function as 
follows: 

Definition 1. A finite function is a function defined 
from an initial segment of Nat to Nat. 

This definition implies that a finite function can be seen 
as an array or a list of natural numbers except that we do not 
limit the size of the representation of its values. 

5.1 Encoding Finite Functions as Tuples 

We can now encode and decode a finite function from [0..fc— 
1] to Nat (seen as the list of its values), as a natural number: 

ftuple2nat [] = 

ftuple2nat ns = haskell2pepis (pred k,t) where 
keener icLength ns 



t=^rom_tuple ns 
nat2ftuple = [] 

nat2ftuple kf = to_tuple (succ k) f where 
(k,f )=pepis2haskell kf 

As the length of the tuple, k, is usually smaller than the 
number obtained by merging the bits of the k-tuple, we have 
picked the Pepis pairing function, exponential in its first 
argument and linear in its second, to embed the length of 
the tuple needed for the decoding. The encoding/decoding 
works as follows: 

ftuple2nat [1,0,2,1,3] 

21295 
nat2ftuple 21295 

[1,0,2,1,3] 
map nat2f tuple [0..15] 

[[] , [0,0] , [1] , [0,0,0] , [2] , [1,0] , [3] , 

[0,0,0,0] , [4] , [0,1] , [5] , [1,0,0] , [6] , 

[1,1] , [7] , [0,0,0,0,0]] 

Note that 

map nat2f tuple [0..] 

provides an iterative generator for the stream of finite func- 
tions. 

5.2 Deriving an Encoding of Finite Functions from 
Ackermann's Encoding 

Given that a finite set with n elements can be put in a 
bijection with [0..n-l], a finite functions / : [0..n — 1] ^ 
Nat can be represented as the list [/(0).../(n — 1)]. Such 
a list has however repeated elements. So how can we turn it 
into a set with distinct elements, bijectively? 

The following two functions provide the answer 
First, we just sum up the list of the values of the function 
with scanl, resulting in a monotonically growing sequence 
(provided that we first increment every number by 1 to 
ensure that values do not break monotonicity). 

fun2set ns — 

map pred (tail (scanl (+) (map succ ns))) 

The inverse function reverting back from a set of distinct 
values collects the increments from a term to the next (and 
ignores the last one): 

set2fun ns — map pred (genericTake 1 ys) where 
l=genericLength ns 
xs =(map succ ns) 
ys=(zipWith (-) (xs-H-[0] ) (0:xs)) 



The following example shows the conversion and its in- 
verse. 

fun2set [1,0,2,1,2] 

[1,2,5,7,10] 
set2fuii [1,2,5,7,10] 

[1,0,2,1,2] 

By combining this with Ackermann encoding's basic step 
set2nat and its inverse nat2set, we obtain an encoding 
from finite functions to Nat follows: 

nat2fun — set2fun . nat2set 
fun2nat — set2nat . fun2set 

nat2fuii 2008 

[3,0,1,0,0,0,0] 
f un2nat [3,0,1,0,0,0,0] 

2008 

Proposition 6. The following function equivalences hold: 
nat2fun o fun2nat = id (15) 



fun2nat o nat2fun = id 



(16) 



One can see that this encoding ignores Os in the binary 
representation of a number, while counting 1 sequences as 



increments. Run Length Encoding of binary sequences ( Mki- 



nen and Navarro 2005| l encodes Os and Is symmetrically, 
by counting the numbers of Is and Os. This encoding is re- 
versible, knowing that Is and Os alternate, and that the most 
significant digit is always 1: 

bits2rle [] = [] 
bits2rle [_] = [0] 

bits2rle (x:y:xs) | x==y = (c+l):cs where 

(c : cs)=bits2rle (y:xs) 
bits2rle (_:xs) =0:(bits2rle xs) 

rle2bits [] = [] 
rle2bits (n:ns) — 

(genericReplicate (nfl) b) 4-f xs where 
xs=rle2bits ns 

b=if []==xs then 1 else l-(head xs) 

By composing them with converters to/from bitlists, we ob- 
tain the bijection nat2rle : Nat — > [Nat] and its inverse 
rle2nat : [Nat] -> Nat 

nat2rle — bits2rle . to_rbitsO 



Proposition 5. The following function equivalences hold: 
nat2set o set2nat = id (13) 



rle2nat = from rbits 



rle2bits 



to_rbitsO = [] 
to_rbitsO n = to_rbits n 



set2nat o nat2set = id 



(14) 



Proposition 7. The following function equivalences hold: 



nat2rle o rle2nat = id (17) 
rle2nat o nat2rle = id (18) 

6. Encodings for "Hereditarily Finite 
Functions" 

One can now build a theory of "Hereditarily Finite Func- 
tions" (HFF) centered around using a transformer like 
nat2f tuple, nat2f un, nat2rle and its inverse f tuple2nat, 
f un2nat, rle2iiat in way similar to the use of nat2set 
and set2nat for HFS, where the empty function (de- 
noted F []) replaces the empty set as the quintessential 
" urf unction " . Similarly to Urelements in the HFS theory, 
"urfanctions" (considered here as atomic values) can be in- 
troduced as constant functions parameterized to belong to 
[0..ulimit — 1]. 

By using the generic rank function defined in section |2] 
we can extend the bijections defined in this section to encod- 
ings of Hereditarily Finite Functions. By instantiating the 
transformer function in unrank to nat2f tuple, iiat2fun 
and nat2rle we obtain: 

nat2hff — unrank nat2fun 
nat2hffl = unrank nat2f tuple 
nat2hff2 = unrank nat2rle 

By instantiating the transformer function in rank we ob- 
tain: 

hff2nat = rank fun2nat 
hff2natl = rank ftuple2nat 
hff2nat2 = rank rle2nat 

The following examples show that nat2hff, nat2hff 1 
and nat2hf f 2 are indeed bijections, and that the resulting 
HFF-tiees are typically more compact than the HFS-tiee 
associated to the same natural number. 

F [] 
nat2hff 1 

F [F []] 
nat2hffl 

F [] 
nat2hffl 1 

F [F [],F []] 
nat2hff2 

F [] 
nat2hff2 1 

F [F []] 

nat2hff 42 

F [F [F []],F [F []],F [F []]] 
nat2hffl 42 

F [F [F [F [],F [],F []],F [] ] ] 
nat2hff2 42 



F [F [],F [],F [],F [],F [],F []] 
nat2hfs 42 

F [F [F []],F [F [],F [F [] ] ] , 

F [F [],F [F [F []]]]] 
F [F [F []],F [F [],F [F [] ] ] , 

F [F [],F [F [F []]]]] 

nat2hff 12345 

F [F [],F [F [F []]],F [], 

F [],F [F [F []],F []],F []] 
nat2hffl 12345 

F [F [F [F [F [F [],F [] ] ] , 

F []]],F [F [],F [],F [F [],F []]]] 
nat2hff2 12345 

F [F [],F [F []],F [F [],F [] ] , 
F [F [],F [],F []],F [F []]] 

hff2nat (nat2hff 12345) 
12345 

hff2natl (nat2hffl 12345) 
12345 

hff2nat2 (nat2hff2 12345) 
12345 

Note that map nat2hff [0..], nat2hffl [0..], 
nat2hffl [0. .] provide iterative generators for the (re- 
cursively enumerable!) stream of hereditarily finite func- 
tions. 

The resulting HFF with urfunctions (seen as digits) can 
also be used as generalized numeral systems with applica- 
tions to building arbitrary length integer implementations. 
Assuming def ault_ulimit=10 we obtain: 

nat2hff 1234567890 

F [A 3, A 2, A 0,A 1,A 7, 
A 0,A 1,A 2, A 0,A 2, A 2 

] 

nat2hffl 1234567890 

F [F [F [F [F [A 0,A 3]] , 

F [F [F [A 2, A 0,A 1]]],A 1]] 

] 

nat2hff2 1234567890 

F [A 2, A 0,A 1,A 1,A 0,A 0,A 6, A 1, 
A 0,A 0,A 1,A 1,A 1,A 0,A 1,A 

] 

which display with the funShow functions given in Ap- 
pendix as: 

funShow 1234567890 

"(32017012022)" 
funShowl 1234567890 

"(((((0 3)) (((2 1))) 1)))" 
funShow2 1234567890 

"(2 011006100111010)" 

Proposition 8. The following function equivalences hold: 



nat2hffl o hff2natl = id 



hf f2natl o nat2hffl = id 



nat2hff o hf f2nat — id 



hf f2nat o nat2hff — id 



(19) 



(20) 



(21) 



(22) 



7. Encoding Finite Bijections 

To obtain an encoding for finite bijections (permutations) we 
will first review a ranking/unranking mechanism for permu- 
tations that involves an unconventional numeric representa- 
tion, factoradics. 

7.1 The Factoradic Numeral System 

The factoradic numeral system ( |Knuth|1997 1 replaces digits 



multiplied by power of a base with digits that multiply 
successive values of the factorial of N. In the increasing 
order variant f r the first digit do is 0, the second is di E 
{0, 1} and the A^-th is e [0..iV - 1]. The left-to-right, 
decreasing order variant f 1 is obtained by reversing the 
digits of f r. 

fr 42 

[0,0,0,3,1] 
rf [0,0,0,3,1] 

42 
fl 42 

[1,3,0,0,0] 
If [1,3,0,0,0] 

42 

The function f r handles the special case for and calls f rl 
which recurses and divides with increasing values of N while 
collecting digits with mod: 

— factoradics of n, right to left 
fr = [0] 
fr n = f In where 
f _ = [] 

f j k = (k 'mod' j) : 

(f (j+1) (k 'div' j)) 

The function fl, with digits left to right is obtained as 
follows: 

fl — reverse . fr 

The function If (inverse of f 1) converts back to decimals 
by summing up results while computing the factorial pro- 
gressively: 

rf ns — sum (zipWith (*) ns factorials) where 
f actorials=scanl (*) 1 [1..] 

Finally, If, the inverse of f 1 is obtained as: 
If = rf . reverse 



7.2 Ranking and unranking permutations of given size 
witli Lelimer codes and factoradics 

The Lehmer code of a permutation / is defined as the num- 
ber of indices j such that I < j < i and f{j) < f{i) 
( |Mantaci and Rakotondrajao|2001| l. 

Proposition 9. The Lehmer code of a permutation deter- 
mines the permutation uniquely. 

The function perm2nth computes a rank for a permutation 
ps of size>0. It starts by first computing its Lehmer code 
Is with perm21ehmer. Then it associates a unique natural 
number n to Is, by converting it with the function If from 
factoradics to decimals. Note that the Lehmer code Ls is 
used as the list of digits in the factoradic representation. 

perm2nth ps = (l,lf Is) where 
ls=perm21ehmer ps 
l=genericLength Is 

perm21ehmer [] — [] 

perm21ehmer (i:is) = 1 : (perm21ehmer is) where 
l=genericLength [j|j^is,j<i] 

The function nat2perm provides the matching unranking 
operation associating a permutation ps to a given size>0 
and a natural number n. 

— generates n-th permutation of given size 
nth2perm (size.n) — 

apply_lehmer2perm (zs+4-xs) [0..size-l] where 
xs=^l n 

l=genericLength xs 
k=size-l 

zs=genericReplicate k 

— converts Lehmer code to permutation 
lehmer2perm xs = apply_lehmer2perm xs is where 

is=[0 . . (genericLength xs)-l] 

— extracts permutation from factoradic "digit" list 
apply_lehmer2perm [] [] = [] 
apply_lehmer2perm (n:ns) ps@(x:xs) = 

y : (apply_lehmer2perm ns ys) where 
(y,ys) — pick n ps 

pick i xs = (x.ys+fzs) where 

(ys,(x:zs)) = genericSplitAt i xs 

Note also that lehmer2perm is used this time to reconstruct 
the permutation ps from its Lehmer code, which in turn is 
computed from the permutation's factoradic representation. 
One can try out this bijective mapping as follows: 

nth2perm (5,42) 

[1,4,0,2,3] 
perm2nth [1,4,0,2,3] 

(5,42) 
nth2perm (8,2008) 

[0,3,6,5,4,7,1,2] 



perm2nth [0,3,6,5,4,7,1,2] 
(8,2008) 

7.3 A bijective mapping from permutations to Nat 

One more step is needed to to extend the mapping be- 
tween permutations of a given length to a bijective mapping 
from/to Nat: we will have to "shift towards infinity" the 
starting point of each new bloc of permutations in Nat as 
permutations of larger and larger sizes are enumerated. 

First, we need to know by how much - so we compute the 
sum of all factorials up to nl. 

— fast computation of the sum of all factorials up to 
sf n = rf (genericReplicate n 1) 

This is done by noticing that the factoradic representation of 
[0,1,1,..] does just that. The stream of all such sums can now 
be generated as usual: 

sf s = map sf [0 . . ] 

What we are really interested into, is decomposing n into the 
distance to the last sum of factorials smaller than n, n_m and 
the its index in the sum, k. 

to_sf n = (k,n-m) where 

k=pred (head [x | [0 . . ] , sf x>n] ) 
m=sf k 

Unranking of an arbitrary permutation is now easy - the in- 
dex k determines the size of the permutation and n-m deter- 
mines the rank. Together they select the right permutation 
with nth2perm. 

nat2perm = [] 

nat2perm n — nth2perm (to_sf n) 

Ranking of a permutation is even easier: we first compute 
its size and its rank, then we shift the rank by the sum of 
all factorials up to its size, enumerating the ranks previously 
assigned. 

perm2nat ps = (sf l)4k where 
(l,k) — perm2nth ps 

nat2perm 2008 

[1,4,3,2,0,5,6] 
perm2nat [1,4,3,2,0,5,6] 

2008 

As finite bijections are faithfully represented by permuta- 
tions, this construction provides a bijection from Nat to the 
set of Finite Bijections. 

Proposition 10. The following function equivalences hold: 

nat2perm o perm2nat = id = perm2nat o nat2perm 

(23) 

The stream of all finite permutations can now be generated 
as usual: 

perms — map nat2perm [0. .] 



7.4 Hereditarily Finite Permutations 

By using the generic unrank and rank functions defined 
in section|2]we can extend the nat2perm and perm2nat to 
encodings of Hereditarily Finite Permutations (HFP). 



nat2hf p 
hfp2nat 



unrank nat2perm 
rank perm2nat 



n! 



The encoding works as follows: 

nat2hfp 42 

F [F [],F [F [],F [F []]], 
F [F [F []],F []],F [F []], 
F [F [],F [F []],F [F [],F [F []]]]] 
hfp2nat it 
42 

Assuming def ault_ulimit=10 and using the string repre- 
sentation provided by permShow (Appendix) we obtain: 

nat2hfp 42 

F [F [] ,A 2, A 3, A 1,A 4] 
permShow 42 

"(0231 4)" 
permShow 1234567890 

"(1 6 (0 1 3 2) 2 3 (0 1 2 3) 
7 8 5 9 4 (0 2 1 3))" 

Proposition 1 1 . The following function equivalences hold: 
nat2hfp o hfp2nat = id = hfp2nat o nat2hfp (24) 

8. Related work 

Natural Number encodings of Hereditarily Finite Sets have 
triggered the interest of researchers in fields ranging from 
Axiomatic Set Theory and Foundations of Logic to Com- 



plexity Theory and Combinatorics (Takahashi 1976[ Kaye 



and Wong 2007 : 'Kirby"'2007'i Abian and Lamacchia"1978 
Booth_1990,.Meir et al.,1983, Leontjev and Sazonov,2000 



Sazonov|1993 Avigad|1997 1. Computational and Data Rep- 
resentation aspects of Finite Set Theory have been de- 
scribed in logic programming and theorem proving contexts 
in Povier et aL]|2000l [Piazza and Policriti|[2004i [Paulson] 
1994| l. Pairing functions have been used work on decision 



problems as early as ( Pepis|[T938; Kalmar||1939[ Robinson 



|1950| |1968b| l. The tuple functions we have used to encode 
finite functions are new. While finite functions have been 
used extensively in various branches of mathematics and 
computer science, we have not seen any formalization of 
hereditarily Finite Functions or Hereditarily Finite Bijec- 
tions as such in the literature. 

9. Conclusion and Future Work 

We have shown the expressiveness of Haskell as a meta- 
language for executable mathematics, by describing natural 
number encodings, tupling/untupling and ranking/unranking 
functions for finite sets, functions and permutations and by 



extending them in a generic way to Hereditarily Finite Sets, 
Hereditarily Finite Functions and Hereditarily Finite Permu- 
tations. 



In a Genetic Programming context (Koza 1992 PoU 



et al. I, the bijections between bitvectors/natural numbers on 
one side, and trees/graphs representing HFSs, HFFs, HPPs 
on the other side, suggest exploring the mapping and its 
action on various transformations as a phenotype-genotype 
connection. 

We also foresee interesting applications in cryptography 
and steganography. For instance, in the case of the permuta- 
tion related encodings - something as simple as the order of 
the cities visited or the order of names on a greetings card, 
seen as a permutation with respect to their alphabetic order, 
can provide a steganographic encoding/decoding of a secret 
message by using functions like nat2perni and perm2nat. 
It looks like an interesting topic to investigate if higher den- 
sity and more random looking steganographic loads could be 
incorporated on top of Hereditarily Finite Permutations. 
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A. Appendix 

To make the code in the paper fully self contained, we list 
here some auxiliary functions. 

String Representations The functions s et Show and f unShow 
provide a string representation of a natural number as a 
"pure" HFS or HFF. They are obtained as instances of gshow 
which provides a generic template parameterized with syn- 
tactic elements. 

setShow = (gshow "{" "," "}") . nat2hfs 
funShow = (gshow "(" " " ")") . nat2hff 
funShowl = (gshow "(' ")") . nat2hffl 
fuiiShow2 = (gshow "( ')") . nat2hff2 
permShow = (gshow "( ')") . nat2hfp 

gshow _ _ _ (A n) = show n 
gshow 1 _ r (F [] ) = 

— empty function shown as rather than () 

if def ault_ulimit > 1 then "0" else 14-|-r 
gshow 1 c r (F ns) — 1-H- 

foldl (-H-) "" 

(intersperse c (map (gshow 1 c r) ns)) 

+fr 



max_bitcount computes the maximum bitcount for a list of 
integers. 

bitcount n — head [x|x^[l. .] , (exp2 x)>n] 
max_bitcount ns — foldl max (map bitcount ns) 

The following functions implement conversion opera- 
tions between bitlists and numbers. Note that our bitlists 
represent binary numbers by selecting exponents of 2 in in- 
creasing order (i.e. "right to left"). 

— from decimals to binary as list of bits 
to_rbits n — to_base 2 n 

— from bits to decimals 
from_rbits bs = from_base 2 bs 

— to binary, padded with Os , up to maxbits 
to_maxbits maxbits n — 

bs -H- (genericTake (maxbits-1)) (repeat 0) where 
bs=to_base 2 n 
l=genericLength bs 

— conversion to base n, as list of digits 
to_base base n = d : 

(if q=0 then [] else (to_base base q) ) where 
(q,d) — quotRem n base 

— conversion from any base to decimal 
from_base base [] = 

from_base base (x:xs) = x-fbase* (f rom_base base xs) 



Bit crunching functions The function bitcount computes 
the number of bits needed to represent an integer and 



